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^N| This extended abstract is dedicated to the analysis of the height of non-plane unlabelled rooted binary trees. The 

1— H height of such a tree chosen uniformly among those of size n is proved to have a limiting theta distribution, both in 

, ^ a central and local sense. Moderate as well as large deviations estimates are also derived. The proofs rely on the 

analysis (in the complex plane) of generating functions associated with trees of bounded height. 
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1 Introduction 



^^ We consider trees that are binary, non-plane, unlabelled, and rooted; that is, a tree is taken in the graph- 

r \ theoretic sense and it has nodes of (out)degree two or zero only; a special node is distinguished, the root, 

which has degree two. hi this model, the nodes are indistinguishable, while no order is assumed between 

1-^, the neighbours of a node. Let y denote the class of such trees, and let 3^„ be the subset consisting of trees 

d with n external nodes (i.e., nodes of degree zero). 

C In this extended abstract, we study the (random) height i/„ of a tree sampled uniformly from 3^„. The 

I— I depth of nodes has been analysed for many ''simple varieties" of trees by Meir and Moon [22]. Regarding 

height, a few special cases were studied early: Renyi and Szekeres [25] proved that the average height 
►^ of labelled non-plane trees of size n is asymptotic to 2%/tui\ De Bruijn, Knuth, and Rice [7] dealt with 

iy~j plane trees and showed that the average height is equivalent to y^Tm as ti — > cx3. Finally, Flajolet and 

\^ Odlyzko [10] developed an approach that encompasses all simple varieties of trees (see [12] for more 

CO results). These results are relative to trees where one can distinguish the neighbours of a node, either by 

^^ their labels (labelled trees), or by the order induced on the progeny (plane trees). 

C^ In such models with distinguishable progeny, there are natural random walks associated to the random 

^^ trees. Also, trees of a fixed size n may be seen as Galton- Watson processes conditioned on the size of 

the total progeny being n; cf [1, 17, 19]. Probabilistic techniques have been applied to the random walk 

associated to a tree traversal in order to derive asymptotic results about the trees, in particular regarding 

_ ^ height and width [4, 5]. Yet an other approach is to find the continuous limit of suitably rescaled random 

K> trees of increasing sizes. One can then read off some of the limit parameters directly on the limiting 

;^ object. This point of view has been adopted by Aldous [2] in his definition of the continuum random tree. 

^ For a recent account of the probabilistic developments, see the survey by Le Gall [20]. 

The case of trees with indistinguishable progeny is essentially different, and no direct random walk 
approach would seem to be possible, due to the symmetries inherent in unlabelled structures. The anal- 
ysis of unlabelled non-plane trees finds its origins in the works of Polya [24] and Otter [23]. However, 
these authors mostly focused on enumeration — the problem of characterizing typical parameters of these 
random trees remains largely untouched. Gittenberger [15] and recently, in an independent study that 
predates ours by a couple of weeks, Drmota-Gittenberger [8] have examined the profile of non-plane 
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unlabelled "general " trees (where all degrees are allowed) and shown that the joint distribution of the 
number of nodes at any finite number of levels converges weakly to the finite-dimensional distribution of 
Brownian excursion local times. They further extended the result to a convergence of the entire profile 
to the Brownian excursion local time. This gives in particular the limit law for the height of these trees. 
This suggests that, although there is no clear reduction of unlabelled trees to random walks, such trees 
largely behave like simply generated families. In particular, this suggests that the rescaled height Hn/^/n 
should admit a limit distribution of the theta type [10, 18, 25]. We shall prove that such is indeed the case 
for binary non-plane trees in Theorems 1 and 2 below. We also provide moderate and large deviations 
estimates (Theorems 3 and 4) as well as asymptotic estimates for the moments (Theorem 5); see §6. Some 
of the more technical proofs are omitted and we limit ourselves to the global structure of the arguments; 
the details may be found in the long version [3]. 

This note arose from questions of Jean-Frangois Marckert and Gregory Miermont [21]. Their moti- 
vation comes from an attempt at extending the probabilistic methods of Aldous to non-plane trees and 
developing corresponding continuous models — we are indebted to them for being at the origin of the 
present study. We also express our gratitude to Alexis Darrasse and Carine Pivoteau for designing and 
programming for us efficient Boltzmann samplers of binary trees and providing detailed statistical data 
that guided our first analyses of this problem. 

2 Trees and generating functions 

Tree enumeration. Our approach is entirely based on generating functions . The class 3^ of binary (non- 
plane unlabelled rooted) trees is defined to include the tree with a single external node. A tree has size n if 
it has n external nodes (hence n — 1 internal nodes). The cardinality of the subclass Xi of trees of size n 
is denoted by j/„ and the generating function (GF) of y is 



y{z) := y y^z" = z + z^ + z^ + 2z'^ + 3z^ + Qz^ + llz'^ + 23z* 



E 

^>l 



the coefficients corresponding to the entry A001190 (Wedderburn-Etherington numbers) of Sloane's On- 
line Encyclopedia of Integer Sequences. 

Since a binary tree is either an external node or a root appended to an unordered pair of two (not 
necessarily distinct) binary trees, one has the basic functional equation 

y{z) = z+-y{zf + -y{z''), (1) 

as follows from fundamental principles of combinatorial enumeration [11, 16, 23, 24]. According to the 
general principles of analytic combinatorics, we shall operate in an essential manner with properties of 
generating functions in the complex plane. The following lemma is classical: 

Lemma 1 Let p be the radius of convergence ofy{z). Then, one has 1/4 < p < 1/2, and p is determined 
implicitly by 2p + yip^) = 1. As z —f p^, the generating function y{z) satisfies 

y{z)^l-\^l-z/p+0{l~z/p), A=v/2p+2p2y'(p2). (2) 

Furthermore, the number ?/„ of trees of size n satisfies asymptotically 

yn^:^-n-'/'p-{l + 0{l/n)). (3) 
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Proof: Let C„_i — ^ (^n-i) ^^ ^^ number of plane binary trees of size n. One has combinatorially 
C„_i2~" < j/„ < C„. The C„, which are the Catalan numbers and they are well-known to grow as 
^r!j^-3/2 jjjg bounds 1/4 < p < 1/2 result. It follows that y{z^) is analytic in a disc properly containing 
1^1 < P- Then, from (1), upon solving for y{z), we obtain 



y{z) = 1 - v/l-2z-y(z2), (4) 

which becomes singular when the argument of the square root vanishes. The value p is then the positive 
solution of 2z + y{z'^) = 1 and, at this point, we must have y{p) = 1. This reasoning also justifies the 
singular expansion (2), seen to be vaUd in a A-domain extending beyond the disc of convergence \z\ < p. 
Equation (3) constitutes Otter's celebrated estimate: it results from translating the square-root singular- 
ity of y{z) by means of either Darboux's method or singularity analysis [11, 16, 23, 24]. D 

Numerically, one finds [9, 11, 23]: 

p = 0.40269 750367, A = 1.1300337163, ^ = 0.3187766259. 

2V7r 

Height. Let y/j „ be the number of trees of size n and height at most h. (Height is measured as the 
maximum number of edges along branches from the root to external nodes.) Let yh{z) — X)n>i yh,nz'^. 
The arguments leading to (1) yield the fundamental recurrence 

yh+i{z) = z+ -yh{zY + -ytiz"^),, and yo{z) = z. (5) 

We also set eh{z) = X]n>i ^h,nz'^ := y{z) — yh{z), which is the generating function of trees with height 
exceeding h. Then, a trite calculation shows that the eh{z) satisfy the main recurrence 

eh{z)\ 1 , 2n 



eh+i{z) =y{z)eh{z)n- ——] + -eh{z ), and eo{z) = y{z) - z, (6) 

on which our subsequent treatment of height is entirely based. 
Analysis. The distribution of height is accessible by 

m r tr ^1,1 Vn ~ yh,n &h,n ,_, 

r{Hn>h] = = ^^, (7) 

Vn Vn 

where eh,n = [z"]e/i(z). We shall get a handle on its asymptotic properties by means of Cauchy's 
coefficient formula, 

1 f dz 

^''■"=2z^y/''(")^^' ^^^ 

upon choosing a suitable integration contour 7 in (8). This task necessitates first developping suitable 
estimates of eh{z), for values of z both inside and outside of the disc of convergence \z\ < p. Precisely, 
we shall need estimates valid in a "tube" around an arc of the circle \z\ = p, 

'^{l^,v) -^{z- -Ai < |z| -/3< Ai, |arg(z)| > 77}. (9) 
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sandclock 




Fig. 1. The "tube" and "sandclock" regions on the left, and the Hankel contour used to estimate eh,n on the right, 
as well as inside a "sandclock" anchored at p 

S{ro,eo) -.^ {z : |z - p| < ro, 7r/2 - ^o < I arg(z - p)| < 7r/2 + ^o}- (10) 

(See Figure 1 for a rendering). 

Plan. Estimates of the sequence of generating functions {eh{z)) within the disc of convergence and a 
tube, where z stays away from the singularity p form the subject of Section 3. The bulk of the technical 
work is relative to the sandclock, in Section 4. We then develop our main approximation in Section 5. 
Sections 3-5 closely follow the general strategy of the original paper [10]; however, nontrivial adaptations 
are needed, due to the presence of Polya terms, so that the problem is no longer of a "pure" iteration type, 
as in [10]. We finally reap the crop in Section 6, where our main theorems relative to the distribution of 
the height are stated and proved (these somewhat parallel the local limit law of [12] in the planar case). 

3 Convergence away from the singularity 

Our aim in this section^'^ is to extend the domain where e^ is analytic beyond the disc of convergence 
\z\ < p, when z stays in a "tube" T{ii,rj) as defined in (9) and is thus away from p. The main result 
is summarized by Proposition 1, at the end of this section. Its proof relies on the combination of two 
ingredients: first, the fact, expressed by Lemma 2, that e^ — > (equivalently: yh —^ y) in the closed disc 
of radius p; second, a general criterion for convergence of the ei^ to 0, which is expressed by Lemma 3. 
The criterion implies in essence that the convergence domain is an open set, and this fact provides the 
basic analytic continuation of the generating functions of interest. 

Lemma 2 For all z such that \z\ < p, and h > 1, one has \eh{z)\ < {\z\/p) jvh. 

Proof: To have height at least h, a tree needs at least /i + 1 nodes, so that |e/i(z)| < 'Ylin>h Vn\z\^ ■ We 



first note an easy numerical refinement of (3), namely, j/„ < },p 



-3/2 



(See [14] for a detailed proof 



strategy in the case of a similar but harder problem.) The claim follows by bounding the sum using an 
integral. D 



''* In what follows, we freely omit the arguments of j/, e^, . . . , whenever they are taken at z. 
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We now devise a criterion for the convergence of the eh to 0. This criterion, adapted from [10, 
Lemma 1], is crucial in obtaining extended convergence regions, both near the circle \z\ = p (in this 
section) and near the singularity p (in Section 4). 

Lemma 3 (Convergence criterion) Let z e V :— {z : \y\ < 1} and assume that \z\ < y/p. The 
sequence {|e;i(z)|, h > 0} converges to if and only if there exist an integer m and real numbers a, P G 
(0,1), such that the following three conditions are simultaneously met: 



< a, 



\y\+a/2<P, a/3+(|z|Vp) < a. (11) 

Furthermore, // (11) holds then, for some C and /3q E (0, 1), one has |e;i| < Ch/3Q,for all h > m. 

Proof: (i) Since |y| < 1 in I? and z < y/p, the convergence e^^ ^ clearly implies that (11) holds. 

(ii) Conversely, assume the three conditions in (11), for some value m. Then, they also hold for m + 1. 
Indeed, recalling (6), we see that, for any h >0, 



\eh+i\ < \eh\[\y\ + 



2 



\eHiz')\ 



< \eh\[\y\ + 



2 



(12) 



where the Polya term involving |e/i(z^) | has been bounded using Lemma 2. Using the hypotheses of (1 1) 

together with (12) above taken ath — m, yields |e„i+i | < a. By induction, (11) then holds for all h > m. 

(Hi) The assertion that je^,! < Chp^ follows by expanding (6) and using /3o — max{/3, jzp/p}. □ 

We can now state the main convergence result of this section: 

Proposition 1 (Convergence in "tubes") For any angle ry > 0, there exists a tube T{p,, rj) with width 
p > 0, such that |e/i(z)| -^ 0, as h —> oo, whenever \z\ lies in T(p, rj). 

Proof sketch: If we exclude a small sector of opening angle 2ri around the positive real axis, then 
the quantity, Ao := sup{ |2/(z)|; 1^1 — p, \ arg(z)| > f]} , satisfies Ag < 1. The continuity of y and 
Lemma 3 then imply the convergence of e/i(z) to in a small disc around each z E {pe'^ • 1^1 — v}- Ths 
latter set being compact, we can extract a finite covering, which must then contain the desired tube where 
the convergence holds. □ 



4 Convergence near the singularity 

We now focus on the behaviour of e/i(z) in a "sandclock" around the singularity. When z approaches 
p, \y\ is no longer bounded away from 1 and the criterion for convergence given by Lemma 3 cannot be 
used directly. However, as we prove below, the quantities |e/i(2)| first exhibits a decreasing behaviour 
for h < N, for some appropriate TV — N{z). At that point, |eAr(2:)| is small enough for the criterion of 
Lemma 3 to be satisfied, whence enventually the convergence |e/i(2:)| ^ as /i ^ cx) in a sandclock. 
The upper bound we use for |eAr| is based on the following alternative recurrence. 

Lemma 4 (Alternative recurrence) Ifet ^ and e^ [l — ei(z^)/ef ] ^ 2yfor i ~ 0, . . . ,h—l then the 
following recurrence relations hold 



(13) 



eh 


_ 1 1 1 1-/ 'y^y^-'e.iz^) ^"^y^-'e, 
eo 2y 1-y ^ 2e^ jr{ ^ 


1- 




1 - 
V 


"2y 


1 - 


e^{z^)^ 
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Proof: The proof relies on a classical idea in the study of slowly convergent iterations [6, 10]. Starting 
with the recurrence relation (6), rewritten as 



e-i+i 



y 



.i+i 






1 



e^iz^) 



the trick is to take inverses. This is a classical technique in the study of slowly convergent iterations (near 
an indifferent fixed point): see for instance [6, page 152]. Using the identity (1 — m)^^ = l + w + u^(l — 
u)^^ shows that 



y 



i+i 



Ci+l 



y 



i-i r 



e^{z') 



2\1 



2y 



y 



i-2 r 



e^{z^) 



n 2 r 



e,(z2) 



2M 



Summing the terms of this equality for i = 0, . . . ,h — I yields the claim. 



D 



An important step of the proof of Proposition 2 consists in controlling the behavior the terms in the 
alternative recurrence for all h < N (we will fix N later). In particular, we need good enough upper and 
lower bounds for |eft(z)|, h < N, and for z around p and p^. Obtaining such estimates requires to study 
the recurrence relation (6) more carefully and to quantify the effect of the analytic Poly a term eh{z'^) for z 
in the "sandclock"5(ro,6'o). The following lemma is evocativeof the theory of iteration near an attractive 
fixed point and gives some estimates on the behaviour of e^ in the interior of the disc of convergence. 

Lemma 5 There exist constants Ki , K2 > and r > 0, such that, for all h > 0, and for \z — p'^\ < r one 
has eh{z) = Ch{z) ■y{z)'^, where Ch{z) = {C{z) + o{l))y{z)''' and C{z) is analytic at p^ . Furthermore, 
Ki < \Ch{z)\ < K2 and a.rg{eh{z)) < co{h+l)r. 

With Lemma 5 in hand, we can obtain a first set of properties of e^iz) for z G S{rQ, 9o) and h not too 
large (depending on z). These will be used to derive an upper bound on |eAr| and prove that e^ satisfies 
the criterion of Lemma 3. In the following, we only need to consider z e S{ro, Oq), with 3(z) > 0, since 
we clearly have e/i(z) = eh{z), where z denotes the complex conjugate of z. 

Lemma 6 (The initial behavior of I e^ I) Suppose '^{z) > 0. Let N{z) — [arccos(l/4)/arg?/(z)J. 

There exist constants rg > and 6q > Q such that if z lies in the sandclock S(rQ,6o) then, for 
l<h< N{z): 



\y 



h+l 



<\eh{z)\<l/2 and < arg(e;,+i) < (/i + 1) arg(?/). 



2{h+l) 
Furthermore, one has |e;i(z)| < 1/5, /or 6 < h < N(z) 



(14) 



Proof: {i) We first focus on the proof of the upper bound in (14). Consider the recurrence relation (6) 
rewritten as 

eh/y\ , e/i(z^) 



e/i+i/y = yeh/y[ 1 - 



2y 



(15) 



The behavior of the first term in (15) is dictated by properties of g : w 1-^ w{l — w/2). A very similar 
function appeared in the analysis of Flajolet and Odlyzko [10, Lemma 3]. By a simple modification of the 
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proof in [10], we have 

I kl<i ^ f lffHI<kl (16) 

|_ < argui < arccos(l/4) I ^ < 8.y:gg{w) < argw. 

We use (16) and an induction argument to prove that, for < h < N{z), 

\eh\<l/2 and arge^ < (ft. + 1) arg y. (17) 

Write z — p + re**. Then, by Lemma 1, we have eo/y = 1 — z/y so that |eo/j/| < 1 — p + 0{^) and 
< a.ig{eo/y) = 0{y/r). In particular, for r small enough, (17) holds for h = 0. 

Suppose now that (17) holds for all integers up to h. To determine whether it also holds for h + 1, 
we have to take the second term in the right-hand side of (15) into account. For tq small enough and 
z G S{rQ, 7r/8), Lemma 5 ensures that this second term cannot contribute any increase in the argument of 
Cfi/y- Therefore, 

arg(e/i+i/y) = a.rg{eh/y) + arg(y) < {h + 2) arg(y). 

Furthermore, since y is analytic in V, we have je^l = |ei(/9)| + 0{y/r) as r ^ 0, for all fixed z > 0. So 
for ro small enough and z e S{ro, vr/S), |eft+i(2)| < 1/2 if h < 5. Now, if ft > 5, 

^ h-\-l ^ oo 

H+i/y\ < \e,{z)/y\ + --Y,\e,{z')/y\ < \e,{p)\ + 0{V^) + - -^^ip + 3r)\ 

i—6 i—Q 

since ei(z^) < (|zp/p)' by Lemma 2. One can then verify that, for ft > 6, \eh+i/y\ < 1/5, for vq small 
enough. So, a fortiori, among the conditions in (16), the one on the modulus holds as long as that on the 
argument does. The latter one holds for all h < N. 

(ii) It remains to prove the lower bound in (14); we start with (15). For ft < N{z), the additional 
Polya term e/i(z^) only contributes to making \eh+i \ larger: for z £ S{ro, 9q), by Lemma 5 and the upper 
bound we have just proved, the arguments of both terms are such that, for ft < N{z), 

|e,+i/y| > |y| • K/y\ ■ (l ^ 

Since x i-^ x{l — x/2) is increasing in [0, 1], we have, for all ft > 0, le^/j/j > //i, where the sequence 
{fh)h>o is defined by fh+i = \y\ ■ A • (1 - fh/2) and /o = |eo|/|2/|- The latter recurrence relation has 
been analysed by Flajolet and Odlyzko [10] in the case of simply generated trees; the bound follows. D 

It is then easy to show that, as expected, the Polya terms are essentially negligible: 

Lemma 7 There exist tq > and ^o > small enough such that, for z G 5(ro, ^o) and for all h < N, 

one has |e^(z2)|/|e^(z)2| < min {l2 • (2p)'^+i, i} . 

Finally, we prove the main result of this section. The proof follows the lines of the analogous statement 
[10, Proposition 4], where the iteration is "pure", but now needs to control the effect of the Polya terms, 
using Lemma 7. 

Proposition 2 (Convergence in a "sandclock" around p) There exist constants tq and 9q such that the 
sequence {eh{z), ft > 0} converges to zero for z £ S{ro, 9o), where the "sandclock" S{ro, Oq) is defined 
by (10). 



8 
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Proof: We prove that for h = N = N{z) defined in Lemma 6, ejv satisfies the convergence criterion of 
Lemma 3. For this purpose, we use the ahernative recurrence stated in Lemma 4 



eh 



1 1-/ 
22/1-2/ 



1 
eo 



/i-i 



-^^^ y' ^ei{z^) 



1=0 



2e? 



i=l 



1 



c2 



1 



2y 



1 



^2 



(18) 
and devise an asymptotic lower bound for the right-hand side. First observe that we can indeed use the 
relation, since by Lemmas 6 and 7, for alH = 0, . . . , A^, the denominators do not vanish. 

Write 1 — y{z) = ee**. Obtaining the claim reduces to proving properties for small e > and t close 
to — 7r/4. The following expansions are valid uniformly for t e [— 7r/4 — S, —tt/4 + S] with < 6 < tt/A 
when e -^ 0: 



1-M 

arg(?/) 



ecost + 0(e2), 
-esini + 0(e^), 



and 



N{z) 



-(p/{esmt) + 0{l) 



where ip : 



l-|y|^ = l-e'^-™** + 0(e), 
arccos(l/4). The first term of the right-hand side of (18) brings the main contribution: 



(19) 



1 1-2/ 



N 



2y 1-2/ 



1 



II 



y 



N\ 



11-2/1 



1 1 
> - 



12/ 



N 



1 



oC/i'-COt t 



2 11-2/1 



2e 



0(1), 



as e ^ 0. On the other hand, we have I^atI = 0(1) for, by Lemma 7, the summands decrease geo- 
metrically. The second error term B]y appearing in (18) can be bounded by splitting the sum into two at 
h = K. Let i^ > 0. By Lemma 7, there is K > 6 large enough that for all i satisfying K < i < N, 
\ei{z'^)\/\ei{z)'^\ < V. Then, by Lemma 6, and for e small enough, we have 



\B 



N 



< 



K/2 



(3/2)^ 



4 1 - ^ • (3/2) 
upon choosing ly = 1/100. It follows that 



1/5 
4 



(1 + ^)2 



1 



12/ 



N 



Nl 



12/ 
\eN\ 



> 



1 — gV-cott / ^ 



1/5 
2-2e 



3 

50 cos t 



(1 + ^) 



+ 0(1)>- 



< 


K 

~2 


+ 


3 


1- 


-e^P 


•cott 




e 




•) 



1-12/ 
1-12/1 



Af 



(20) 



(21) 



for all z G I? such that e < eg and \t — 7r/4| < 5q, as soon as both eo and 5q are small enough. 

We can now focus on the criterion for convergence (Lemma 3) and the conditions in (1 1) for to = TV. 
From (19) and (21) we have f or e > small enough. 



4 1 - e'^-'^ 



A simple verification shows that the coefficient of e above is at most —1/4 for all t close enough to — 7r/4. 
Thus, for all e > small enough, 

^ip cot t 



ejv < 



e • e'^ 



2 1 - e*^-' 



and 



12/1 



|ejv| 
2 



< 1 



/3. 



(22) 



Also, by (19), we have (|zp/p)^ = o(a(l — /3)). Hence the criterion for convergence of Lemma 3 is 
satisfied with the values of a and (3 specified in (22), as soon as e is small enough. D 
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5 Main approximation 

In this section, we estimate e/i(z) around the singularity. 

Proposition 3 (Main estimate for Ch in a sandclock around p) There exist n , 9i and K, K' such that 
for all z e S(r\, B\) and all h > 1, 

y'' 1 1 - y'* (I 

- ' Rh{z) where \Rh{z)\ < Kmin {log- — ,log(l + /i) 



eh 2 1-y . V /.- ^ o^ 

Furthermore, |i?/j — -R^+il l£ K'/h. 

The main idea is to obtain a better control on the error term using bounds extending those obtained in 
Section 4 for /i > N, knowing the a priori information that Ch converges. The proof of Proposition 3 also 
requires the bounds to be uniform in the distance to the singularity jz — jo| and in h. 

Lemma 8 (Uniform lower bound for |e/i|) ForanyS G (0,1), there exist constants ri,9i > such that 
ifz&S{ri,di), then for all h> 0, one has \eh{z)\ > (1 - 5)''+V(2(/i + 1)). 

Proof: Let S G (0, 1). For r small enough, we have |y| > 1 — d/2 provided r :— \z ~ p\ < tq. Then, by 
Lemma 6, the desired lower bound is clear for h < N. So we now assume that h > N. The recurrence 
relation (6) imphes that 

By Lemma 3, \eh\ is decreasing for h > N, when the criterion is satisfied. So, for z E S{ro, Oq) as in 
Proposition 2, we have 

\eN\\ ,„ , \eh{z')\ 



However, we have set N in such a way that |y| + |eAr|/2 < 1, and by Lemma 2, we have le/i+ij > 
(1 — i5)|e;j| — {p + r)^. Routine verifications then show, for r small enough, that 1 — 5 > p + r, the 
negative contribution decreases fast enough so that \eh\ remains bounded from below as desired. □ 

We can now proceed with the proof of a uniform upper bound for | Ch \ when z e S{ri^Oi). 

Lemma 9 (Uniform upper bound for 1 6^(^)1 around p) There exist constants ci, ri and 6i such that, 
for any h > I, and z G S{ri, 6i), we have |e/i(z)| < Ci/h. 

Proof: Write 1 — y = ee'* for some e > and t. It suffices to prove that the result holds for all such z 
provided e is small enough and t close enough to — 7r/4. We use again (18). Then, we have. 



1 1-/ 



2y 1-2/ 



h\ 1 L.l'i 1 L,l^ 



1 ll-y-^l l-|y|" l-\y 



> wrv-^ - -^r^- (23) 



2\y\ \l-y\ - 2\l~y\ 2e 



The error terms Ah and Bh are bounded as in the proof of Proposition 2 and we obtain, for all /i > and 
e > small enough, \Ah\ < Ki and |B,J < K2 + \{l - |y|'')/(l - |2/|)'"'- One then sees that, for all 

'"* In what follows, we use generically ii", /Ci , . . . to denote positive absolute constants, not necessarily of the same value at different 
occurrences. 
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h> 0, and all t close enough to — 7r/4, 

For h not too large, \y\^ decreases at least linearly in h, and one can show that \y\'^ < 1 — She, for some 
small (5 > and all < /i < iV as long as |t + 7r/4| is small enough. Equation (24) then implies that 
HI < Ki/h, for all h <N. 

On the other hand, by (19), if ft. > N then the factor |y|'', ensures the desired decreasing behaviour 
Indeed, fore small enough and i close enough to —7r/4, one has |y I < (1 — e/2) and |e;i| < Kc,e{l — e/2)'^. 
The maximum of the right-hand side above is obtained for e — 2/{h + 1), which implies that je^j < 
2Kz/{h + 1) for ft > TV. This completes the proof. D 

Proof of Proposition 3: The proof consists in using Lemma 9 above to bound the error terms in (18) for 
z e 5(r2, 62), with r2 = min{ro, ri} and O2 = minj^Oj ^i}- For some constants C2 and C3, we have 

Also, since A^ and By^ are partial sums, i?/j — Rh+i only contains one summand, which is easily seen to 
be 0(l/ft) uniformly by Lemmas 8 and 9. □ 

6 Asymptotic analysis and distribution estimates 

The basis of our estimates relative to the distribution of height is Proposition 3 in conjunction with 
Cauchy's coefficient formula (8) where 7 is a contour comprised of the arc of an outer circle of radius 
larger than p (and interior to the cardioid-shaped region, where \y\ < 1) and a set of two connecting 
segments passing through the singularity p (Figure 1). In addition, it proves useful, in order to garantee 
well-defined determinations of square roots, to think of the two segments as in fact joined by an infinites- 
imal arc of a circle that passes to the left of the singularity p. The strategy just described belongs to the 
general orbit of singularity analysis methods. 

Theorem 1 (Limit law of height) The height iJ„ of a random tree taken uniformly from 3^„ admits a 
limiting theta distribution: for any fixed a; > 0, there holds 




lim P(iJ„ > x^n) = y (fc^A^x^ - 2)e-'= ^ " /^ A := V2p + Vy'(p')- 



fc>i 



Proof: The integration contour in Cauchy's formula (8) is 7 = 71 U 72 U 73. There, 71 is the segment 
lying in the half-plane 3(2;) > 0, 72 is the complex-conjugate image of 71, and 73 is the outer circular 
arc. For the radius of the latter circle, we adopt r„ = p(l + log n/n) . We assume that 73 lies in a legal 
tube (granted by Proposition 1) and that 71 and 72 are in an overlapping sandclock such that Proposition 3 
applies. We set, for some ^i > : 71 = 72 = {p(l + ^e^'^/^+'^i) : a; G [0, (5„]} , with (5„ such that the 
rectilinear portions 7i and 72 connect with the outer circle 73. So, we have (5„ ~ log^ ^/(nsin^i). 
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Outer circular arc (73). By Proposition 1, we have e^(z) -^ uniformly on 73 as /i -^ 00. In 
particular, all moduli |e/i(z)| are bounded by an absolute constant K. On the other hand the Cauchy 
kernel z^" is small on the outer part of the contour, so that 



dz 
efiiz) — rrr 



< Kip "cxp(-log^n) 



(25) 



This contribution is thus exponentially small compared to y„. 

Rectilinear parts (71, 72)- Our objective is to replace Ch by the simpler quantity 

1 — w 

eh{z) = eh := 2- j-y^, 

1 -y" 



(26) 



as suggested by Proposition 3. Along 71,72, the singular expansion of y{z) applies, so that 1 — y = 
0((logn)/n^/^) and the error term Rh{z) is O(logn). There results that (1 — y'^)/{\ — y) is always at 



least as large in modulus as K2\Jnl log n (by a study of the variation of 1 1 — 



,-/ir| 



have 



/ y^ '^i + o (^^^"^ 



eh eh \ \ V^ 

It proves convenient to define the following approximation for eh,n'- 



|1 — e "^1), and we 



(27) 



E{h,n):^- 



1 



2i7T 



eh 



71U72 



dz 



(28) 



and effect the change of variables z — p (l — ^). The quantity t then varies from —inSnC^^^^, loops to 
the right of the origin, then moves away to inSne^^^ . With the singular expansion of y{z) as in (2), we 
have on 71,72, 



z-'" = p-"e* (1 + (log'* n/n)) , y{z) = 1 - Xy/tJ^ + O {t/n) 

and, with h = x^/n and \t\ < K2 log n: 

y^ = expi^XxVi) • (l + O (log^ n/V^)) . 
We also find^"'^, for the range of values of t corresponding to 71 , 72: 



1-y 



1 — exp(— Aa;\/t) 



1 + 



log* n 



(29) 



(30) 



(31) 



The approximations (29), (30), and (31) motivate considering, as an approximation of E{h, n) in (28), 
the contour integral 



J{X) 



1 /■ exp(-X\/t) 



2«7r Jc 1 - cxp(-XV^) 



y/te*dt= V / exp(~kXVi)Vie*dt, (32) 

k>l-"~ 



'"'* The expression log* n represents an unspecified positive power of log n. 
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where C is the image of 71 U 72 in the change of variable. We can now make J{X) explicit. Each integral 
on the right side can be evaluated by the change of variables w = i\J~i, t = —w^. By completing the 
square and flattening the image contour C onto the real line, we obtain: 

j(X)^^^e-'='^'/4(fc2x2-2). (33) 

^''^ k>l 

Error management. It can be checked that the replacements: en i~+ e/i, y ^-^ 1 — X^JtJn, and y^ 1-^ 
exp(— AxVt) only entail error terms of order p~"n~^ log"^ n, which implies, for h — x^Jn, for h — x^n: 

eh,n = 2Ap-"n-3/2 j(Ax) + O (p"" log* n/n^) . 

The explicit form of J{X) in (33) and the asymptotic form of y„ (Lemma 1), yield the statement. □ 

Theorem 2 (Local limit law of height) The height iJ„ of a random tree taken uniformly from 3^„ admits 
a local limiting distribution: for x in a compact set q/M>o and h — x^/n an integer, there holds 

P(F„ ^h)^ ^P— Y^ik^X^x^ - &eX'x^)e-''"^"^"'\ 
■^'^"' fc>i 

Proof: Proceeding like in the proof of Theorem 1, we can justify approximating the number of trees of 
height exactly h by the integral 

— j^ {en - e.+O ^^, where e. - e.+i = 2y ^^ _ ^.^^^ _ ^.^.y 

The approximations (30) and (31) then motivate consideration of the integral 

'^ ' 2nrJc{l-cxp{-XVt)y 
and one finds (with the auxiliary estimate Rh — Rh+i — 0{{log^ n)/y^) provided by Proposition 3): 

2/„./^ - yn,h+i = 2A2p— n-2 Ji(Ax) + O ip^'^^j? 

On the other hand, differentiation under the integral sign yields Ji{X) = —J'{X), which proves the 
statement. □ 

Revisiting the proof of Theorems 1 and 2 shows that one can allow x to become either small or large, 
albeit to a limited extent. Indeed, it can be checked, for instance, that allowing x to get as large as 
0{\/\ogn) only introduces extra powers of log n in error estimates. However, such extensions are limited 
by the fact that the main theta term eventually becomes smaller than the error term. We state (compare 
with [12, Th. 1.1]): 
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Theorem 3 (Moderate deviations) There exist constants A,B,C > such that for h = x^n with 
Aj \J\o% n < X < A^log n and n large enough, there holds 



fe>i 



< ^- (34) 



n^' 



In particular, (/a; — > oo in such a way that x < A\/log n, then, uniformly. 

Similar estimates hold for the local law. 

These estimates can be supplemented by (very) large deviation estimates in the style of [12, Th.1.4]: it 
suffices to make use of the fact that ej^ is bounded from above by a large power and optimize on r e (0, p) 
the saddle-point bound 

eh,n<^^, 0<r<p. 

The probability of a linearly height is then exponentially small: 

Theorem 4 (Very large deviations) There exists a continuous decreasing function /(^) satisfying /(^) > 
Ofor < ^ < 1, such that, given any fixed 6 > 0, one has far h — ^n, and (5 < ^ < 1 — i5, 

where K depends on S. 

Finally, the approximation of e/j by e^ in (26) is good enough to grant us access to moments (cf also [10]). 
The problem reduces to estimating generating functions of the form 

M.(z) = 2(l-,f^/^'--^, 

h>l ^ ^ ' 

which are accessible to the Mellin transform technology [13], upon setting y = e~'^. 
Theorem 5 (Moments of height) Let r > 1. The rth moment of the height Hn satisfies 

E [Hn] - I V^ and E[ij;] - r{r - l)C(r)r(r/2) fjj v!'!'^, r > 2. 



A 
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